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Abstract 

Background: The lipopolysaccharide (LPS) is the major immuno-dominant antigen of all Legionella species 
including L pneumophila. Its diversity is the basis for the classification of L pneumophila into serogroups and 
monoclonal subgroups and is thought to be involved in strain specific virulence. The understanding of the 
genetic basis of the LPS-antigen is incomplete. Thus, we analyzed the genetic locus involved in LPS-biosynthesis 
of L pneumophila serogroup 1 (Sg1) strains with the focus on strain specific gene composition. 

Results: The LPS-biosynthesis loci of 14 L pneumophila Sg1 strains comprise two distinct regions: A 15 kb region 
containing LPS-biosynthesis genes that can be found in all L pneumophila strains and a Sgl -specific 18 kb region. 
The 15 kb region is highly conserved among Sgl strains as reflected by high homologies of single ORFs and by a 
consistent ORF arrangement. In contrast, the Sgl specific 18 kb region is variable and partially disrupted by phage 
related genes. We propose that the region spanning from ORF 6 to ORF 1 1 of the Sgl -specific region is likely 
involved in late LPS-modification. Due to the high variability of this small region and various combinations of single 
ORFs within this region a strain specific LPS-structure could be synthesized including modifications of legionaminic 
acid derivates. 

Conclusions: Our data clearly demonstrate that the gene structure of the LPS-biosynthesis locus of L pneumophila 
Sgl strains show significant interstrain variability. These data can be used for further functional analysis of the LPS 
synthesis to understand pathogenesis and reactivity with monoclonal antibodies. Moreover, variable but strain 
specific regions can serve as basis for the development of novel genotyping assays. 
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Background 

Legionella pneumophila is one of 56 described species be- 
longing to the genus Legionella of the family Legionellaceae 
[1]. These Gram-negative bacteria are ubiquitous inhabi- 
tants of natural and manmade aquatic environments where 
they survive parasitically in protozoa like amoeba [2,3] and 
in community structures such as biofilms [4,5]. Addition- 
ally, Legionella can infiltrate the human lung via inhaled 
aerosols [3,6] and subsequently infect alveolar macrophages 
[7] which frequently cause a potential fatal pneumonia 
termed Legionnaires' disease (LD) [8]. L pneumophila 
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strains belonging to the serogroup 1 (Sgl) were predomin- 
antly reported in LD cases, especially in community ac- 
quired and travel-associated cases [9,10]. 

Lipopolysaccharide (LPS) is the major immuno- 
dominant antigen of all Legionella species including 
L. pneumophila [11]. It is the main component recog- 
nized by patients sera and by diagnostic assays in urin- 
ary antigen detection [12]. The LPS molecule possesses a 
high degree of diversity and thereby provides the basis for 
the classification of L. pneumophila into serogroups and 
subgroups by monoclonal antibodies (mAb) [13-15]. Sgl 
strains are subdivided into nine mAb-subgroups using the 
Dresden monoclonal antibody panel (Table 1) [16]. 

The structural characterization of LPS of L. pneumophila 
identified several specific chemical attributes which differs 
it from the LPS molecules of other Gram-negative 
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Table 1 Monoclonal antibody based subgrouping of 
L pneumophila Sgl strains using the Dresden panel 



Monoclonal 8/5 3/1 3 8/4 10/6 20/1 26/1 30/4 

antibody subgroup 



Philadelphia 


+ 


+ 


+ 






Allentown/France 


+ 


+ 


- 




+/ _a 


Benidorm 


+ 


+ 




+ 




Knoxville 


+ 


+ 


+ - 






OLDA 


+ 




+ 


+/ _a 


+ +/-° 


Oxford 


+ 




+ 




+/ _a 


Heysham 


+ 




+ - 






Camperdown 


+ 








+ + 


Bellingham 


+ 




- - + 


+ 


+ + 


a Variable results with different strains. 








bacteriareviewd 


in 


[17]. 


Particularly 


the 


O-antigen 



homopolymer structure consists of an unusual residue, 5- 
acetamidino-7-acetamido-8-0-acetyl-3, 5, 7, 9-tetradesoxy 
-D-glycero-D-galacto-nonulosonic acid (legionaminic acid) 
and its derivates [18-20]. 

A central step in understanding the correlation of the 
LPS structure and pathogenesis of L. pneumophila was 
the description of the genetic background of LPS mole- 
cules by Luneberg and colleagues [21], More precisely, a 
genetic locus composed of at least 28 open reading 
frames (ORF) is essential in LPS core oligosaccharide 
biosynthesis and LPS O-chain biosynthesis. The genes of 
this 31-36 kb cluster have characteristic functions re- 
quired for the synthesis, transport, translocation and 
modification of LPS components. The lag-1 gene of this 
biosynthesis locus encodes for an O-acetyltransferase 
which is responsible for the 8-O-acetylation of legio- 
naminic acid [22], Strains carrying a functional lag-1 
synthesize an LPS epitope that reacts with the mAb 3/1 
(initially named mAb 2 [23]) of the Dresden monoclonal 
antibody panel. This epitope is assumed to contribute to 
an increased virulence [22,24] since mAb 3/l + strains 
represent the most prominent subgroup of clinical Le- 
gionella isolates. In contrast, strains lacking lag-1 carry 
mainly deacetylated LPS molecules. These mAb 3/1" 
strains comprise only a small number of clinically identi- 
fied L. pneumophila strains in immunocompetent pa- 
tients [9,10]. 

Besides the mAb 3/1 specific O-acetylation of the 
legionaminic acid epitope, to date it remains elusive how 
strain specific mAb-reactivities can be explained. In- 
creased understanding of the genetic background and 
structural LPS properties of the different Sgl strains 
could help to comprehend subgroup distributions 
among clinical and environmental isolates [9,16,25-27] 
and would deliver more insight in the role of LPS in the 
L. pneumophila life cycle. 



To achieve this goal, we analyzed the LPS -biosynthesis 
loci of at least one member of each mAb-subgroup (ex- 
cluding mAb-subgroup Oxford) of the L. pneumophila 
Sgl. In this study we focused on the genetically compos- 
ition of the loci and putative genotype-phenotype corre- 
lations according to the Dresden panel of mAbs. 

Results and discussion 

Two regions within the LPS-biosynthesis locus 

To gain insight into the genetic composition and ar- 
rangement of the LPS biosynthesis locus we analyzed 
the loci of 14 L. pneumophila Sgl strains. The strains 
represent members of all mAb-subgroups that can be 
distinguished by the Dresden monoclonal antibody panel 
(Table 1) besides the extremely rare mAb-subgroup 
Oxford. The LPS biosynthesis loci of five strains 
were newly sequenced for this study. These were: 
Camperdown 1 and Heysham 1 of the rarely found sub- 
groups of the same name [9,25] and the strains Uppsala 
3, Gorlitz 6543 and L10/23. Eight LPS biosynthesis loci 
were obtained from complete genomes that have been 
published previously. Furthermore, for strain RC1 (mAb 
subgroup OLDA) the biosynthesis locus was available as 
well (Table 2). 

The LPS-biosynthesis locus of each of the analyzed 
L. pneumophila Sgl strains contained at least 28 ORFs 
and ranged in size from 30,644 bp (strain Lorraine) to 
35,888 bp (strain 130b) with an average locus size of 
33,398 bp respectively. The average ORF size within the 
locus was approximately 1 kb. The complete LPS- 
biosynthesis locus had a slightly lower % GC content (~ 
35%) than the adjacent regions (~ 38%) and the total 
genome (~ 38.5%), respectively. 

Structural and comparative analysis of the loci con- 
firmed a highly conserved 15 kb region from wecA (ORF 
14) to lpg0748 (ORF 28) according to the Philadelphia 
genome as shown previously [34]. Additionally, all ORFs 
within this region were consistently orientated into the 
same direction (Figure 1A and B). 

A second region within the locus of 18 kb in size is 
spanning from lpg0779 (ORF 1) to lpg0764 (ORF 13). 
Here, the structural composition and the orientation of 
ORFs as well as the total amount of putative ORFs was 
less conserved. Interestingly, many transposases and phage 
related genes were present in 8 strains (Figure 1A). 

The heterogeneous nature of the 18 kb region and the 
extremely high conserved 15 kb region found in our 
study are largely in agreement with earlier results. These 
proposed to separate the locus into a Sgl specific and a 
L. pneumophila specific region [34,35]. Microarray 
analysis of Sgl and non-Sgl strains have identified a 
13 kb region (ORF 16-28) which is present in all 
L. pneumophila strains and a 20 kb region (ORF 1-15) 
generally found in all Sgl strains [34]. The two regions 
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Table 2 LPS biosynthesis loci obtained from sequenced genomes of L pneumophila Sgl strains 



Strain 


mAb subgroup 


Accession no. 


Reference 


Alcoy 2300/99 


Knoxville 


GenBank: NC_014125.1 


[28] 


Corby 


Knoxville 


GenBank: NC_009494.2 


[29] 


L10/23 (Ulm)* 


Knoxville 


EMBL: HF545881 


this study 


Uppsala 3* 


Knoxville 


EMBL: HE980445 


this study 


Paris 


Philadelphia 


GenBank: NC_006368.1 


[30] 


Philadelphia 1 


Philadelphia 


GenBank: NC_002942.5 


[31] 


HI 0604 1 D^S 


Rpllinnh^m 


FMRI ■ FOQSR71 1 


pzj 


Gorlitz 6543* 


Bellingham 


EMBL: HF678227 


this study 


Camperdown 1* 


Camperdown 


EMBL: HE980447 


this study 


Heysham 1* 


Heysham 


EMBL: HE980446 


this study 


130b (Wadsworth) 


Benidorm 


EMBL: FR687201 


[33] 


Lens 


Benidorm 


GenBank: NC_006369.1 


[30] 


Lorraine 


Allentown 


EMBL: FQ958210 


[32] 


RCf 


OLDA 


EMBL: AJ277755 


[21] 



* only LPS biosynthesis locus sequenced. 

were defined based on the LPS -biosynthesis loci of the 
Sgl strain Paris [30]. 

To determine the putative breakpoint between both 
regions is difficult. However, based on our analysis of 
the structural composition we would rather separate the 
LPS biosynthesis locus between lpg0763 (ORF 13) and 
wecA (ORF 14). This is in agreement with recent data, 
since the genes wecA (ORF 14) and galE (ORF 15) were 
demonstrated to be present in non-Sgl strains with 
lower amino acid similarities when compared to Sgl 
strains (55-61%) [35]. 

The initially mentioned ORF 13 is located next to the 
breakpoint region. In total, four different types of ORFs 
were found in the analyzed region of Sgl strains here 
named ORF 13-a, -b, -c and -A. In each of the strains 
Lens, 130b, HL 06041035 and Gorlitz 6543 two ORFs 
were found. These strains carried a putative conserved 
protein of unknown function (here referred to as ORF 
13-A). A transposase-disrupted ORF 13-A was present 
in strain 130b (Figure 1A). Additionally, the strains 
carried an ORF which shared features of the radical 
S-adenosylmethionine (SAM) superfamily (CDD: 
cd01335) named ORF 13-c (Additional file 1: Table S2). 
Interestingly, all these strains lacked the ORF 12. How- 
ever, even though the strain Lorraine lacked ORF 12 as 
well, it carried only a single ORF 13-A variant. 

A distinct ORF of unknown function with amino acid 
similarity to ORF 13-A of only 38%, here named ORF 
13-a, was present in the remaining strains with the ex- 
ceptions of a truncated form in strains RC1, Philadelphia 
1 and Paris. Philadelphia 1 and Paris shared high similar- 
ities with ORF 13-a but a deletion led to a frame shift 
resulting into three smaller fragments (pooled as ORF 
13-b) (Table 3). 



A conserved region found in all serogroup 1 strains 

Within the conserved region several genes were found 
which are proposed to be involved in the biosynthesis of 
the highly acetylated core region which is composed 
of mannose, N-acetyl-glucosamine (GlcNAc), N-acetyl- 
quinovosamine (QuiNAc) and rhamnose residues [19]. A 
vast number of ORFs, more specifically ORF 21 through 
25 and 28, were recently reported to facilitate the biosyn- 
thesis of the repetitive legionaminic acid residues of the 
O-antigen [18,36]. The pyrodoxal-phosphate dependent 
aminotransferase (ORF 21), the acetyltransferase neuD 
(ORF 22) and a dehydratase (lpg0966) located outside of 
the locus are likely to synthesize the precursor molecule of 
legionaminic acid, UDP-AJAT-diacetylbacillosamine (UDP- 
Bac2Ac4Ac) [37]. Contradictory to our findings, functions 
of the neuD products are described highlighting that the 
acetyltransferase is involved in Lag- 1 -independent O- 
acetylation of few legionaminic acid residues close to the 
LPS-core of L. pneumophila [21,38,39]. Furthermore, the 
adjacent genes encoding for NeuC (ORF 23), NeuB (ORF 
24) and NeuA (ORF 25) were recently identified to be 
involved in the subsequent biosynthetic processes conver- 
ting UDP-Bac2Ac4Ac into CMP-5,7-diacetamido-3,5,7, 
9-tetradeoxy-D-^/ycero-D-^/o-nonulosonic acid (CMP- 
Leg5Ac7Ac) [36]. Moreover, the ORF 28 is homologous to 
the ptmG gene of Campylobacter jejuni (Q1324) which 
converts the CMP-Leg5Ac7Ac residue to CMP-5- 
acetamidino-7-acetamido-3,5,7,9-tetradeoxy-D-^/ycero-D-^ 
^/o-nonulosonic acid (CMP-Leg5Am7Ac) [40], the dom- 
inant residue of the O-antigen of non-Sgl strains of 
L. pneumophila [41]. A functional correlation of the ORFs 
of this region is supported by recent transcriptomic data 
of strain Paris in which the ORFs 21-17 and 28-22 were 
transcribed as operons [42]. Since all analyzed Sgl strains 
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A: 

Alcoy 2300/99, Corby, L10/23 (Knoxville) 

Orf1 K Orf2 j ag -i Orf 3 . wzt A wzm . Orf 6 a Orf 7 Orf 8 Orf9 K Orf 10 K Orf11 K Orf 12 K .Orf 13-a 

^ -^^s^-: — :> — N— > X=>- 

Uppsala 3 (Knoxville) 

Orf 1 Orf 2. Jag-1 L * ,Orf3 A wzt .wzm. Orf 6 t Orf 7 .Orf 8 Orf 9 K Orf 10 ' Orf 11 k Orf 12 K .Orf 13-a 

Paris, Philadelphia 1 (Philadelphia) 

Orfl Orf 2 K Jag-1 k * A Orf3 A wzt wzm A Orf 6 . Orf 7 Orf 8 Orf ft Orf 10. Orf 11 K Orf 12 K A Orf13-b 

^>Q4m>fMK^^z^< i^ss-^==>^=>ai»i==><j<,^ 

Lens (Ben idorm) 

Orfl Orf 2 lag-1 Orf 3 A wzt wzm A Orf 6 Orf 7 Orf 8 Orf 9 Orf 10 Orf 11 , Orf 13-c Orf 13-/ 

- X >jm:< 4-//.^< k^\\xxns-=|h :>: z> :>#<|s-<. 

130b (Ben idorm) 

Orfl Orf 2 Jag-1 Orf 3 A wzt wzm Orf 6 . Orf 11 Orf 10 Orf 9 Orf 8 Orf 7 . Orf 13-c J Orf 13-A 

^^^>4rm^< \+mt -K — < — :-te^^s> f ^ feia;^— t:::: 

Lorraine (Allentown) 

Orfl Orf 2 Jag-1 Orf 3 A wzt wzm . Orf 6 . Orf 7 Orf 8 Orf 9 Orf 10 Orf 11 Orf 13-A 
RC1 (OLDA) 

Orfl Orf 2 Orf 3 A wzt wzm A Orf 6 . Orf 7 Orf 8 Orf 9 Orf 10 K Orf 11 Orf 12 k Orf 13-a 
HL 06041 035 (Bellingham) 

Orf1 K Orf 2 Orf 3 A wzt wzm Orf 6 . Orf 7 A Orf 8 K Orf9 K Orf 10 Orf 11 Orf 13-c Orf 13-A 

^X=X^, i^C==^ssssgs| 111 =^^==>^^v-"-"> 

Gorlitz 6543 (Bellingham) 

^ Orf 1 ^Orf 2^ ^-^ ^ Orf 3^ ^z^^^^tt^ Q r f6 ^ Orf 7 ■ i ^^ M ^^ M ^^^ M ^^f^^ ^ 0rf °^ * ^V^/^fr ^ 13A 

Camperdown 1 (Camperdown) 

Orft Orf 2 Orf 3 A wzt wzm Orf 6 Orf 7 Orf 8 Orf 9^ Orf 10 Orf 11 Orf 12 . A Orf 13-a 

^^^^< ^ ^ss^^>^>^ 

Heyshaml (Heysham) 

Orfl 0rf2 Orf3 , wzt wzm , Orf6 Orf7 Oris Orf9 K Orf10 K Orfll Oil 12 Orf13-a 

-^=>^-<^< ^ 4^>^^==h — \. : — ^ — ^>— ^ — n^; 

B: 

All Sg1 strains 

^wecA A 93lE jm\A gp\ rmlB rjnID rpilC yvfE ijpuD neuC jrieuB n jpuA h jsH bjsF oseA-like 

Orf 14 Orf 15 Orf 16 Orf 17 Orf 18 Orf 19 Orf 20 Orf 21 0rf~22 Orf 23 Orf 24 Orf25 0rf26 0rf27 Orf 28 

1 kb 

Figure 1 Structural representation of the LPS-biosynthesis locus. Shown are the LPS-biosynthesis loci of 14 L pneumophila Sg 1 strains and 
the corresponding monoclonal subgroup (in brackets). Strains Alcoy 2300/99, Corby and LI 0/23, and Paris and Philadelphia 1, respectively had 
the same genetic structure and monoclonal subtype and were therefore shown in one scheme. The numbering of ORFs was adopted by [21]. 
A: shows the Sg 1 -specific 18 kb region (ORFs 1-13) and B: shows the 15 kb region (ORFs 14-28). The direction of transcription is indicated by 
arrowheads. The filled black arrows indicate transposases/phage-related proteins. Grey shades and hatched patters serve to distinguish ORFs. 
Asterisk in Uppsala 3, Philadelphia 1 and Paris represents a partial ORF 2 duplication (ORF 2 like) as described by [46]. Underlined ORFs 7-1 1 in 
strain 130b represent an inversion. Gorlitz 6543 carries a truncated lag-1 marked with + . 



Table 3 Amino acid similarity of the L pneumophila Sgl specific LPS-biosynthesis region from Ipg0769-lpg0761 (ORF 1 - ORF 15) of strain Philadelphia 1 to 
other Sgl strains 



Amino acid similarity [%]* 



Gene name of L pneumophila 


Philadelphia* 




Knoxville# 




Benidorm# 


Bellingham# 


Allentown# 


OLDA# 


Camperdown# 


Heysham# 


Sgl strain Philadelphia 1 


Paris 


2300/99 Alcoy 


Corby 


Uppsala 3 


Ulm 


130b 


Lens 


HL0604 1035 


Gorlitz 6543 


Lorraine 


RC1 


Camperdown 1 


Heyshaml 


Ipg0761 (galE) 


ORF 15 


100 


100 


100 


100 


100 


97.1 


96.0 


99.8 


99.8 


99.8 


100 


100 


100 


Ipg0762 (wecA) 


ORF 14 


100 


99.5 


99.5 


99.5 


99.5 


93.4 


93.1 


93.7 


93.7 


93.4 


98.8 


99.5 


99.5 


lpl0803 A 


ORF 13 


- 


40.3 


40.3 


40.3 


40.3 


trans. c 


100 


98.2 


98.2 


96.6 


41.8 


40.3 


40.3 


Ipg0765 


ORF 12 


100 


98.6 


98.7 


98.6 


98.6 


- 


- 


- 


- 


- 


98.7 


98.6 


trans. c 


Ipg0766 


ORF 11 


100 


96.6 


96.6 


96.6 


96.6 


93.2 


93.2 


93.7 


93.7 


93.1 


96.6 


96.6 


96.6 


\pg0767 


ORF 10 


100 


96.2 


96.2 


96.2 


96.2 


96.6 


97.1 


98.9 


98.9 


97 


95.6 


96.2 


96.2 


Ipg0768 


ORF 9 


100 


30.6 


30.6 


30.6 


30.6 


98.4 


99 


99 


99 


98.9 


99.4 


30.6 


30.6 


Ipg0769 


ORF 8 


100 


31 


31 


31 


31 


97.9 


97.4 


98.4 


98.4 


97.4 


100 


31 


31 


\pg0770 


ORF 7 


100 


90.6 


90.6 


90.6 


90.6 


32 


31.9 


31.9 


31.9 


99.8 


99.9 


90.6 


90.6 


Ipg0771 


ORF 6 


100 


38.8 


38.7 


38.7 


38.7 


38.8 


99.1 


100 


100 


38.8 


38.6 


99.1 


38.7 


\pg0772 (wzm) 


ORF 5 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 


Ipg0773 (wzt) 


ORF 4 


100 


99 


99.6 


100 


100 


100 


99.6 


100 


99.5 


99 


99.8 


100 


100 


Ipg0774 


ORF 3 


100 


91.6 


86.4 


98.7 


92.1 


89 


86.4 


100 


86.4 


91.6 


99.5 


99.8 


99.8 


Ipg0775 a 




100 






100 




















Ipg0776 b 




100 






100 




















\pg0777 (lag-1) 




100 


96.8 


94.9 


100 


96.8 


94.9 


94.9 




94.7 + 


96.8 








Ipg0778 


ORF 2 


100 


97.9 


97.4 


100 


97.7 


97.4 


97.4 


99.6 


96.5 


97.9 


98.9 


98.7 


98.7 


\pg0779 


ORF 1 


100 


99.8 


99.1 


99.8 


99.8 


98.9 


98.9 


100 


98.9 


99.8 


99.4 


99.8 


99.8 



# Monoclonal antibody subgroup according to the 'Dresden' panel. 

* Determined by UPGMA clustering method based on multiple sequence alignment. 

A ORF 13 (Ipg0764/lpg0764b/lpg0763) of Philadelphia 1 not displayed, ORF 13-A of strain Lens was used. 

a Partial duplication of ORF 2 {Ipg0778). 

b< c Transposase; transposase disrupted. 

+ Lag-1 of Gorlitz 6543 has no functional start codon. 

Underlined numbers indicate different clusters of corresponding ORF (see also Figure 2). 

The highly conserved 15 kb region (ORF14 - ORF 28) is not completely shown and only reflected by WecA and GalE. 
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and a broad number of non-Sgl strains carry ORF 28 
[35,43,44] it can be assumed that CMP-Leg5Am7Ac is a 
common residue of the L. pneumophila LPS molecule 
which might subsequently become modified in a mAb- 
subgroup or even strain specific manner. 

Three clusters of the O-acetyltransferase Lag-1 

A well examined phenotype variation is linked to the 
presence and absence of the lag-1 gene. Lag-1 encodes 
for an O-acetyltransferase that conferred reactivity with 
mAb 3/1 and is exclusively found in Sgl strains. Our re- 
sults revealed three clusters of the lag-1 genes, although 
without any detectable relation to the mAb-subgroup 
switch which supports recent findings [45] (Figure 2A). 
Lag-1 was previously reported to be involved in mAb- 
subgroup switches of different strains. However, this was 
generally due to gene deletion or loss-of-function muta- 
tions of lag-1 [46-49]. Complete and functional lag-1 
genes were present in all mAb 3/l + strains and were 
absent in all mAb 3/1" strains. Besides that, the 
Philadelphia subgroup strains (Philadelphia 1 and Paris) 
as well as the Knoxville-subgroup strain Uppsala 3 car- 
ried a transposase and a partial duplication of ORF 2 ad- 
jacent to lag-1. Bernander et al. reported the region 
from ORF 2 to ORF 3 as unstable [46]. Looping out of 
the intermediate located lag-1 gene is assumed to be a 
potential consequence. Under in vitro conditions the 



deletion of the lag-1 gene occurred at with frequency of 
10" 6 to 10" 7 (C. Luck, unpublished results). Detailed 
analysis of the region from ORF 2 to ORF 3 including 
lag-1 of these strains revealed remarkably high similar- 
ities of Uppsala 3 to the Philadelphia-subgroup strains 
Philadelphia 1 and Paris (>98-100%) whereas the 
remaining Knoxville-subgroup strains clustered in a dif- 
ferent group (Table 3; Figure 2A). The high similarity of 
this 4 kb region between strain Uppsala 3 and the strains 
Paris and Philadelphia 1 may indicate horizontal gene 
transfer of this region. However, this had no impact on 
the specific mAb reactivity for all other analyzed 
Knoxville-subgroup strains. Horizontal gene transfer be- 
tween strain Paris and Philadelphia 1 was recently 
reported for a large genome fragment which also har- 
bored the LPS biosynthesis locus [32]. These observa- 
tions are confirmed by our results since the LPS 
biosynthesis loci of both strains are almost identical in 
composition, arrangement and amino acid similarity 
(Additional file 1: Table S2). A truncated lag-1 gene was 
found in the strain Gorlitz 6543 (mAb-subgroup Belling- 
ham) as recently reported [49]. The whole gene is 
present but carries a mutated start codon. Since Gorlitz 
6543 showed no reactivity with mAb 3/1 it was assumed 
that the mutation significantly impairs the production of 
a functional O-acetyltransferase. Phylogenetic analysis 
showed 99.9% amino acid similarity of Gorlitz 6543 to 



m cp 


c 

<J> § o> i 


A: lag-1 




96.8 


32 


1.9 






3 : 




52 


o o o 

T , . .? , ? 


c 

o o o c 

, .T , f ..1 


B: ORF 6 


1 

9-5 5 




60, 




99 2 


60 6 



Paris (Philadelphia) 
Philadelphia 1 (Philadelphia) 
Uppsala 3 (Knoxville) 



Lorraine (Allentown) 
L10/23 (Knoxville) 
Alcoy 2300/99 (Knoxville) 

130b (Benidorm) 
Lens (Benidorm) 
Corby (Knoxville) 
Gorlitz 6543 (Bellingham)t 



HL 06041035 (Bellingham) 
Gorlitz 6543 (Bellingham) 
Philadelphia 1 (Philadelphia) 
Paris (Philadelphia) 
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Figure 2 Dendrogram of variable ORFs. Multiple amino acid based cluster analysis using UPGMA (BioNumerics, Applied Maths NV, Belgium). 
The phylogenetic trees of gene log-] and of the ORFs 6, 7 and 8 are shown. ORF 9 is identical to the phylogenetic tree of ORF 8 and is therefore 
not shown. Similarity values and branch distances were depicted in percentages [%]. The strain-specific mAb-subgroup is indicated in brackets. 
The mutated start codon of lag-1 of Gorlitz 6543 was neglected for similarity analysis and is indicated with + . 
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Corby (mAb-subgroup Knoxville), 130b and Lens (both 
mAb-subgroup Benidorm) (Figure 2A). 

ABC-transporter genes wzt and wzm as Sgl-specfic 
marker region 

Noticeable conserved genes within the heterogenic re- 
gion were wzt (ORF 4) and wzm (ORF 5) which are 
almost identical among all analyzed Sgl strains 
(Figure 1A, Table 3). Wzm encodes for a protein 
containing a transmembrane domain while wzt encodes 
for a nucleotide binding domain of an ABC transporter 
system which mediates the O-antigen translocation 
across the inner membrane [50]. Recently, both genes 
were evaluated as marker genes for PCR based discrim- 
ination between L. pneumophila Sgl and non-Sgl strains 
[35]. The ABC transporter-dependent O-antigen path- 
way interacts with WecA (ORF 14), an UDP-GlcNAc-1- 
transferase which initiates O-chain biosynthesis at the 
cytoplasmic site of the inner membrane [50]. The low 
amino acid similarity of WecA between Sgl and non- 
Sgl that was described recently combined with the ab- 
sence of wzm and wzt in non-Sgl genomes [35] indicate 
a different O-chain biosynthesis mechanism for non-Sgl 
strains than found in Sgl strains. 

ORF 6 through 1 1 involved in O-antigen modification 

The most variable region within the Sgl -specific region 
in terms of low similarities on the amino acid level and 
the diverse arrangement of single ORFs was found from 
ORF 6 to ORF 11. The strains of mAb-subgroup 
Benidorm 130b and Lens were almost identical regard- 
ing the amino acid similarities of the single ORFs within 
the Sgl -specific region. Interestingly, strain 130b carried 
a large inverted fragment containing ORF 7 to ORF 11 
(Figure 1A). This region was surrounded by transposases 
suggesting their potential contribution to the inversion. 
Since the strain 130b showed no altered reactivity 
pattern using the Dresden panel compared to other 
Benidorm strains it could be stated that the inversion 
had no detectable effect on the LPS phenotype detected 
by monoclonal antibodies. 

The adjacent ORF 6 showed a high degree of variabil- 
ity between L. pneumophila Sgl strains represented 
by two clusters of low amino acid similarity (<39%) 
(Figure 2B). Interestingly, the two analyzed strains of the 
mAb-subgroup Benidorm, 130b and Lens, cluster into 
two distinct groups. This either indicates that the prod- 
uct of ORF 6 has probably no effect on the LPS struc- 
ture of strains of the same monoclonal subgroup or that 
it has the same function despite low similarity. 

However, ORF 6 products might be involved in the es- 
tablishment of a mAb-subgroup discriminating epitope. 
More precisely, only the mAb-subgroups Heysham and 
Knoxville react with mAb 3. This indicates a similar 



epitope which in turn could possibly be traced back to 
specific ORFs within the Sgl -specific region. However, 
strains of both mAb-subgroups were highly homologous 
regarding the whole LPS -biosynthesis with the exception 
of lag-1 which is present in Knoxville strains. (Figure 2B, 
Table 3). In addition, the strain Camperdown 1, not 
reacting with mAb 3, carried a very similar LPS- 
biosynthesis locus as Heysham 1 and the Knoxville 
strains. However, it is the single ORF 6 in which 
Camperdown 1 clusters differently to Heysham 1. It can 
be assumed that the combination of ORF 6 to 9 which is 
exclusively found in Knoxville and Heysham strains 
leads to reactivity with mAb 3. Another ORF 6 as found 
in the genetically very similar strain Camperdown 1 
could alter the LPS epitope and is thereby not recog- 
nized by mAb 3. Furthermore, the mAb 3 epitope was 
not influenced by O-acetylation of the legionaminic acid 
residue since the Knoxville strains were mAb 3/l + and 
carried the lag-1 gene whereas the strain Heysham 1 is 
negative for both markers. 

Modification of legionaminic acid in transposon mutants 

Two additional ORFs, ORF 8 and ORF 9, within in the 
highly variable region from ORF 6 to ORF 11 are most 
likely involved in O-antigen modification. The genetic 
nature of the ORF 8 products displayed two different 
clusters which was comparable to the clustering of ORF 
9. Both clusters share poor amino acid similarities of 
31% (ORF 8) and 30.7% (ORF 9) (Table 3, Figure 2D). 
These differences in amino acid similarity were also 
reflected by the ORF orientation. Both ORFs were orien- 
tated into opposite directions in strains of the mAb- 
subgroups Knoxville, Camperdown and Heysham which 
form a separate cluster in both ORFs (Figure 1A). For 
the remaining mAb-subgroups (Philadelphia, Allentown, 
Benidorm, Bellingham and OLDA) the ORFs are ori- 
ented into identical directions. In silico analysis of these 
loci predicted a five-gene operon from ORF 8 to ORF 12 
suggesting a coupled functional entity [51]. These strains 
were also grouped into a single cluster. However, recent 
transcriptomic data obtained from strain Paris revealed a 
four-gene operon which lacks ORF 8 [42]. For all strains 
regardless of the distance in the phylogenetic tree 
BLASTP predicted a methyltransferase function for ORF 
8 [48,52] and a siliac acid synthetase function (neuB 
family) for ORF 9 [21]. 

It is reported that the putative methyltransferase 
encoded by ORF 8 is responsible for A/-methylation of 
the 5-acetimydyol amino group of the legionaminic acid 
[48,52]. This is supported by studies on the legionaminic 
acid pathway of Campylobacter. The ptmH gene 
(Q1325) of C. jejuni is a homologue of ORF 8 of the 
Knoxville, Camperdown and Heysham subgroup cluster 
(Figure 2D) [40]. The ptmH product catalyzes the 
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modification of CMP-Leg5Am7Ac to the N- methylated 
residue CMP-5-acetimidoyl (Af-methyl) amino-7-acetami 
do-3,5,7,9-tetradeoxynon-2-ulosonic acid (CMP-Leg5Am 
NMe7Ac), the main residue of the Sgl O-antigen. 
Disruption of ORF 8 in the Bellingham-subgroup strain 
Gorlitz 6543 led to loss-of-reactivity with the 
Bellingham-subgroup specific mAb 10/6 and mAb 20/1 
and resulted in a mAb-subgroup switch from subgroup 
Bellingham to Camperdown. In similar mutants of the 
mAb 3/l + strain 130b the reactivity with mAb 20/1 
was also lost when ORF 8 or ORF 11 was disrupted 
leading to a switch from mAb- subgroup Benidorm to 
Allentown. The wild type strains 130b and these mu- 
tants did not react with mAb 10/6. This supported the 
assumption that the mAb 3/1 -specific epitope generated 
by the O-acetyltransferase Lag-1 masks the Af-methyl 
group and hinders binding of mAb 10/6 [48]. This is in 
agreement with earlier observations which reported 
a correlation between ORF 8 and Af-methylated legiona 
minic acid residues for the mAb 3/1" strain RC1 [52]. 
However, the fact that mutants of both strains, 130b and 
Gorlitz 6543, lost the reactivity with mAb 20/1, indicated 
that ORF 8 and/or ORF 11 are also involved in the gen- 
eration or modification of another epitope which is not 
blocked by the O-acetyl group. 

To find putative ORF candidates, next to ORF 8, that 
are responsible for synthesis or modification of the com- 
mon epitope bound by mAb 20/1, we looked for similar 
but unique ORFs within the Sgl -specific region of 
Bellingham- and Benidorm-subgroup strains. Phylogen- 
etic analyses identified ORF 7 as a putative subgroup dis- 
criminating gene since the mAb-subgroups Benidorm and 
Bellingham clustered in specific separate group when 
compared to the other mAb-subgroups (Figure 2C). The 
presence of two different ORF 7 variants is in agreement 
with recent results obtained by subgroup specific PCR 
amplification [49]. 

Conclusions 

Characterization of the LPS -biosynthesis loci of 
L. pneumophila Sgl strains revealed two mayor regions: 
A Sgl -specific region of 18 kb and a conserved 15 kb re- 
gion containing genes found in Sgl and non-Sgl strains. 
The conserved region carries genes involved in outer 
core and O-chain biosynthesis of LPS molecules. 

The variable and heterogeneous Sgl -specific region 
raised questions concerning the genetic basis for 
subgroup specific mAb-reactivity. Switches from one 
monoclonal subtype to another in transposon induced 
mutants gave a first indication for the function of differ- 
ent gene products. The most variable region from ORF 
6 to ORF 11 is likely involved in O-antigen modifications 
and could deliver more insight in the strain specific LPS 
structures and more important, in strain specific 



virulence. The ORFs within this region could act in a 
pathway-like manner explaining the broad variability of 
the LPS molecule among the Sgl strains. Furthermore, it 
is also not excluded that each ORF of this region has an 
own function in the late modification of legionaminic 
acid derivates which could be regulated in a life cycle or 
growth phase-depended way. Further studies using spe- 
cific mutation in these ORFs, mRNA assays and chem- 
ical analysis are required in order to elucidate the role of 
different genes in the synthesis of the subgroup specific 
structures in different strains. 

Methods 

Phenotypic and genotypic characterization of L. 
pneumophila strains 

Legionella pneumophila Sgl strains Camperdown 1 
(ATCC 43113), Heysham 1 (ATCC 43107) [23], 
Uppsala 3 [46] and Gorlitz 6543 [49] were grown on 
buffered charcoal yeast extract (BCYE) agar plates (Oxoid, 
Germany) for 48 hr at 37°C under a 5% C0 2 atmosphere. 
Monoclonal subgrouping was accomplished using the 
Dresden panel of mAb as described elsewhere [13,16]. 

DNA extraction and sequence generation 

DNA was extracted using the EZ1 DNA Tissue Kit 
(Qiagen, Germany). Prior to sequencing DNA fragments 
of the LPS -biosynthesis locus were PCR-amplified using 
GoTaq polymerase (Promega, US-WI) and LPS -specific 
primers (Additional file 2: Table SI) which were 
designed based on published L. pneumophila genomes. 
Initial denaturation was carried out at 95°C for 2 min 
followed by 30-35 cycles: 95°C denaturation for 30 s, 
annealing at various temperatures for 1 min and elong- 
ation at 72°C for 1 min/kb. Final elongation for 5 min at 
72°C completed the amplification protocol. The PCR re- 
sult was checked on 1.5% agarose gel with 5 V/cm (LE 
Agarose, Biozym, Germany) and purified (MSB Spin 
PCRapace, Invitek, Germany) for sequence reaction. 

Sequencing reactions were accomplished by a cycle- 
sequencing procedure on an automated DNA sequencing 
machine (ABI Prism 377, Applied Biosystems, US-CA). 

The LPS -biosynthesis locus of the strain L 10/23 was 
sequenced during a whole genome sequencing project. 
This strain was isolated during a cooling tower related 
outbreak in Ulm (Germany) in 2010 [53]. 

Sequence annotation and analysis 

Obtained sequences of Camperdown 1, Heysham 1, 
Uppsala 3, Gorlitz 6543 and L10/23 were assembled 
using SeqMan (DNASTAR Lasergene 8, US-WI) and 
controlled against public databases using BLAST [54]. 
ORF annotation of all analyzed strains was accomplished 
with GeneMark.hmm [55] and Artemis [56]. The anno- 
tation and the numbering of ORFs was based on the 
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initially described LPS -biosynthesis locus of strain RC1 
(mAb-subgroup OLDA) [21] and if possible supplemen- 
ted by further description of genes, gene products and 
their putative functions using BLAST, BLASTP [54,57], 
the LegionellaScope database of the MicroScope Micro- 
bial Genome Annotation Platform [58] and the Con- 
served Domain Database [59]. Since Luneberg et al. 
analyzed the strain RC1 which had 30 ORFs the num- 
bering of ORFs in other L. pneumophila Sgl strains with 
deviating ORF numbers is not continual [21]. The genes 
iraA (ORF 29) and iraB (ORF 30) were not taken into 
account as part of the LPS -biosynthesis locus. Both 
formed a small 2-gene operon responsible for iron as- 
similation, infection and virulence [60] . 

The putative coding regions were compared to already 
known LPS -biosynthesis ORFs of published L pneumophila 
strains using the SeqMan program. The LPS- 
biosynthesis clusters of the strains were deposited in the 
EMBL database under the number [EMBL: HE980447] 
for strain Camperdown 1 (mAb-subgroup Camperdo 
wn), [EMBL: HE980446] for strain Heysham 1 (mAb- 
subgroup Heysham), [EMBL: HE980445] for strain 
Uppsala 3 (mAb-subgroup Knoxville), [EMBL: HF678227] 
for strain Gorlitz 6543 (mAb-subgroup Bellingham) and 
[EMBL: HF545881] for strain L10/23 (mAb-subgroup 
Knoxville) (Table 2). 

Sequence homologies of single ORFs were calculated 
based on multiple alignments using BioNumerics 6.0 
(Applied Maths NV, Belgium) and BLASTP [57]. Cluster 
analysis was performed using the UPGMA method of 
the BioNumerics 6.0 software package. 

The sequences of other LPS -biosynthesis loci were 
obtained from complete genomes of the following 
strains: Paris (mAb-subgroup Philadelphia) (GenBank: 
NC_006368.1), Lens (mAb-subgroup Benidorm) (Gen 
Bank: NC_006369.1), Philadelphia 1 (mAb-subgroup 
Philadelphia) (GenBank: NC_002942.5), Alcoy 2300/99 
(mAb-subgroup Knoxville) (GenBank: NC014125.1), 
Corby (mAb-subgroup Knoxville) (GenBank: NC_0094 
94.2), Lorraine (mAb-subgroup Allentown) (EMBL: 
FQ958210), HL 06041035 (mAb-subgroup Bellingham) 
(EMBL: FQ958211), RC1 (mAb-subgroup OLDA) 
(EMBL: AJ277755) and 130b (mAb-subgroup Benidorm) 
(EMBL: FR687201.1) (Table 2) [21,28,29,31-34]. Since 
the genome of 130b is a draft version we closed a se- 
quencing gap in scaffold 4 (position 918107 to 918206) 
using PCR and sequencing. 

Availability of supporting data 

The data sets supporting the results of this article are 
available in the Lab Archives repository, DOI: 10.6070/ 
H4WM1BBQ. It includes a list of all primers used for 
ORF amplification and sequence generation (Additional 
file 2: Table SI), a spreadsheet containing detailed 



information about the LPS -biosynthesis locus such as 
ORF identifier, ORF size and putative size of the trans- 
lated ORF product (Additional file 1: Table S2) as well 
as the % GC content of the ORFs of the Sgl-specific 
region (Additional file 1: Table S3). 
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